Quick glance
Dr. Adeleke’s study data can be found on the GDx here.
Summary of data by state
| State | Sites |
|---|---|
| Abia | 26 |
| Adamawa | 30 |
| Akwa Ibom | 26 |
| Anambra | 13 |
| Bauchi | 20 |
| Bayelsa | 12 |
| Benue | 12 |
| Borno | 4 |
| Cross River | 18 |
| Delta | 23 |
| Ebonyi | 28 |
| Edo | 22 |
| Ekiti | 30 |
| Enugu | 17 |
| FCT | 20 |
| Gombe | 19 |
| Imo | 27 |
| Jigawa | 19 |
| Kaduna | 24 |
| Kano | 23 |
| Katsina | 15 |
| Kebbi | 8 |
| Kogi | 16 |
| Kwara | 13 |
| Lagos | 3 |
| Nasarawa | 10 |
| Niger | 14 |
| Ogun | 22 |
| Ondo | 23 |
| Osun | 32 |
| Oyo | 21 |
| Plateau | 19 |
| Rivers | 20 |
| Sokoto | 14 |
| Taraba | 15 |
| Yobe | 14 |
| Zamfara | 22 |
Data processing
Starting dataset dimensions: 815 rows, 25 columns
- Remove ‘test’ rows
- username = test, training_user, or state_1
- Data dimensions after removing ‘test’ rows: 714 rows
- Prepare community coordinate data
- Currently a string of 4 values (lat, long, elevation, & precision), so split that into 4 individual fields
- Transform fields to numeric fields (formerly character)
- Remove rows missing lat/long for communities (20 rows)
- Data dimensions after removing missing coordinates: 694 rows
Questions for Dr. Adeleke
There are 2 fields with coordinates:
r_GPSCommunity&r_GPS_river_basin- were the coordinates from ther_GPSCommunityfrom the actual fly collection site or are these village centroids?Why are some rows missing community coordinates?
Questions for Dimagi
- Could Dimagi please filter out ‘test’ rows prior to posting data?
- Is there a consistent flag depicting a ‘test row’?